Overview
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 4500 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.9 MiB |
| Average record size in memory | 914.1 B |
Variable types
| Text | 6 |
|---|---|
| Numeric | 3 |
| DateTime | 1 |
| Categorical | 9 |
Cluster is highly overall correlated with PatientIncome | High correlation |
PatientIncome is highly overall correlated with Cluster | High correlation |
ClaimLegitimacy is highly imbalanced (67.3%) | Imbalance |
ClaimID has unique values | Unique |
PatientID has unique values | Unique |
ProviderID has unique values | Unique |
PatientIncome has unique values | Unique |
Reproduction
| Analysis started | 2026-01-11 18:21:54.022805 |
|---|---|
| Analysis finished | 2026-01-11 18:21:56.660254 |
| Duration | 2.64 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
ClaimID
Text
Unique
| Distinct | 4500 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 373.7 KiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 4500 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 4d76c7f7-d36a-4139-b451-a9a4ad10d7d5 |
|---|---|
| 2nd row | e35193b4-3609-492b-866a-98de19317e9c |
| 3rd row | 1f3fa373-25ed-4ff4-b6c7-38dcb2fb297f |
| 4th row | af6a68f4-8319-47b1-a28b-77de01572851 |
| 5th row | 417fe944-79d2-4610-81c4-a2d496f29ee4 |
| Value | Count | Frequency (%) |
| 417fe944-79d2-4610-81c4-a2d496f29ee4 | 1 | < 0.1% |
| 291cfa64-9956-40e7-b89f-4628650f42f0 | 1 | < 0.1% |
| 4d76c7f7-d36a-4139-b451-a9a4ad10d7d5 | 1 | < 0.1% |
| e35193b4-3609-492b-866a-98de19317e9c | 1 | < 0.1% |
| 1492c9c7-e184-413d-b951-f4377400782f | 1 | < 0.1% |
| a1684758-40b1-4f1d-8f5b-409c7228dbac | 1 | < 0.1% |
| 2c2ed3f4-90c6-4681-94b4-d20278d85963 | 1 | < 0.1% |
| 379d7c46-3096-4741-9d42-26f540347070 | 1 | < 0.1% |
| ab6b425f-957e-4448-a715-97d8aabddb6d | 1 | < 0.1% |
| e1464b6a-4ea4-4fa1-952d-e16ebdd032c5 | 1 | < 0.1% |
| Other values (4490) | 4490 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12975 | 8.0% |
| 8 | 9645 | 6.0% |
| a | 9608 | 5.9% |
| b | 9564 | 5.9% |
| 9 | 9537 | 5.9% |
| 6 | 8529 | 5.3% |
| f | 8511 | 5.3% |
| e | 8473 | 5.2% |
| 2 | 8464 | 5.2% |
| Other values (7) | 58694 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 162000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12975 | 8.0% |
| 8 | 9645 | 6.0% |
| a | 9608 | 5.9% |
| b | 9564 | 5.9% |
| 9 | 9537 | 5.9% |
| 6 | 8529 | 5.3% |
| f | 8511 | 5.3% |
| e | 8473 | 5.2% |
| 2 | 8464 | 5.2% |
| Other values (7) | 58694 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 162000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12975 | 8.0% |
| 8 | 9645 | 6.0% |
| a | 9608 | 5.9% |
| b | 9564 | 5.9% |
| 9 | 9537 | 5.9% |
| 6 | 8529 | 5.3% |
| f | 8511 | 5.3% |
| e | 8473 | 5.2% |
| 2 | 8464 | 5.2% |
| Other values (7) | 58694 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 162000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12975 | 8.0% |
| 8 | 9645 | 6.0% |
| a | 9608 | 5.9% |
| b | 9564 | 5.9% |
| 9 | 9537 | 5.9% |
| 6 | 8529 | 5.3% |
| f | 8511 | 5.3% |
| e | 8473 | 5.2% |
| 2 | 8464 | 5.2% |
| Other values (7) | 58694 |
PatientID
Text
Unique
| Distinct | 4500 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 373.7 KiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 4500 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 19cf2638-3ec0-4ed9-9995-d9ba4553813a |
|---|---|
| 2nd row | 5c4bb6c5-4dd3-4a86-85fa-f36c0d8debff |
| 3rd row | 777866e0-4d10-45a8-a7b4-dbdaa26d5a81 |
| 4th row | 9d7c53ee-eb1a-4f07-9e3a-e86cf82e9f0f |
| 5th row | db14b0ca-ac2a-4e83-b085-947ea32e7587 |
| Value | Count | Frequency (%) |
| db14b0ca-ac2a-4e83-b085-947ea32e7587 | 1 | < 0.1% |
| 2bd2d173-4ce1-428d-836c-259d9236a839 | 1 | < 0.1% |
| 19cf2638-3ec0-4ed9-9995-d9ba4553813a | 1 | < 0.1% |
| 5c4bb6c5-4dd3-4a86-85fa-f36c0d8debff | 1 | < 0.1% |
| fb07a807-4dcc-4e09-bea6-4ca54acf6add | 1 | < 0.1% |
| 638c3542-dc16-4507-95f8-a1bb0c425624 | 1 | < 0.1% |
| bce42931-4ff7-487b-b373-773bdb57241b | 1 | < 0.1% |
| 09a26428-831a-4d5f-bd9f-ee790468aae5 | 1 | < 0.1% |
| 67f76baf-3c23-45ee-8898-ec4a25c85e11 | 1 | < 0.1% |
| 72f46521-1f31-4707-bbd9-4760af6d9d5c | 1 | < 0.1% |
| Other values (4490) | 4490 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12931 | 8.0% |
| 8 | 9664 | 6.0% |
| 9 | 9593 | 5.9% |
| a | 9583 | 5.9% |
| b | 9565 | 5.9% |
| d | 8596 | 5.3% |
| e | 8508 | 5.3% |
| 5 | 8484 | 5.2% |
| 6 | 8474 | 5.2% |
| Other values (7) | 58602 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 162000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12931 | 8.0% |
| 8 | 9664 | 6.0% |
| 9 | 9593 | 5.9% |
| a | 9583 | 5.9% |
| b | 9565 | 5.9% |
| d | 8596 | 5.3% |
| e | 8508 | 5.3% |
| 5 | 8484 | 5.2% |
| 6 | 8474 | 5.2% |
| Other values (7) | 58602 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 162000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12931 | 8.0% |
| 8 | 9664 | 6.0% |
| 9 | 9593 | 5.9% |
| a | 9583 | 5.9% |
| b | 9565 | 5.9% |
| d | 8596 | 5.3% |
| e | 8508 | 5.3% |
| 5 | 8484 | 5.2% |
| 6 | 8474 | 5.2% |
| Other values (7) | 58602 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 162000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12931 | 8.0% |
| 8 | 9664 | 6.0% |
| 9 | 9593 | 5.9% |
| a | 9583 | 5.9% |
| b | 9565 | 5.9% |
| d | 8596 | 5.3% |
| e | 8508 | 5.3% |
| 5 | 8484 | 5.2% |
| 6 | 8474 | 5.2% |
| Other values (7) | 58602 |
ProviderID
Text
Unique
| Distinct | 4500 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 373.7 KiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 4500 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | a3d0cc80-dffe-40ff-a302-23c8ffeedb36 |
|---|---|
| 2nd row | a9f25acf-92b8-45e2-9cef-87bd07d0a591 |
| 3rd row | 951b1e08-9948-4956-80e5-9277f16bd290 |
| 4th row | de9e193a-f9a1-4d63-9345-aefe75694628 |
| 5th row | 5c7d7045-71b6-4c15-937c-43e4cfe65bf4 |
| Value | Count | Frequency (%) |
| 5c7d7045-71b6-4c15-937c-43e4cfe65bf4 | 1 | < 0.1% |
| cf84cf99-0ac3-465a-af90-239a873bafa5 | 1 | < 0.1% |
| a3d0cc80-dffe-40ff-a302-23c8ffeedb36 | 1 | < 0.1% |
| a9f25acf-92b8-45e2-9cef-87bd07d0a591 | 1 | < 0.1% |
| 4cbf206c-b046-40d6-b953-927d2ed77950 | 1 | < 0.1% |
| 20685b18-4e11-4a78-b714-1d5e70610385 | 1 | < 0.1% |
| 9e339ef9-c22f-4a42-b299-857fbbc1fa81 | 1 | < 0.1% |
| 4d2bab66-8b53-40b4-9724-c8c77522e8c3 | 1 | < 0.1% |
| 866b8799-1436-4c1b-ae34-37e1fda57c99 | 1 | < 0.1% |
| e84f0876-375d-40f4-a112-46c24ac59627 | 1 | < 0.1% |
| Other values (4490) | 4490 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12912 | 8.0% |
| 8 | 9676 | 6.0% |
| 9 | 9606 | 5.9% |
| a | 9544 | 5.9% |
| b | 9384 | 5.8% |
| f | 8602 | 5.3% |
| e | 8574 | 5.3% |
| 1 | 8547 | 5.3% |
| 5 | 8485 | 5.2% |
| Other values (7) | 58670 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 162000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12912 | 8.0% |
| 8 | 9676 | 6.0% |
| 9 | 9606 | 5.9% |
| a | 9544 | 5.9% |
| b | 9384 | 5.8% |
| f | 8602 | 5.3% |
| e | 8574 | 5.3% |
| 1 | 8547 | 5.3% |
| 5 | 8485 | 5.2% |
| Other values (7) | 58670 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 162000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12912 | 8.0% |
| 8 | 9676 | 6.0% |
| 9 | 9606 | 5.9% |
| a | 9544 | 5.9% |
| b | 9384 | 5.8% |
| f | 8602 | 5.3% |
| e | 8574 | 5.3% |
| 1 | 8547 | 5.3% |
| 5 | 8485 | 5.2% |
| Other values (7) | 58670 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 162000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 18000 | 11.1% |
| 4 | 12912 | 8.0% |
| 8 | 9676 | 6.0% |
| 9 | 9606 | 5.9% |
| a | 9544 | 5.9% |
| b | 9384 | 5.8% |
| f | 8602 | 5.3% |
| e | 8574 | 5.3% |
| 1 | 8547 | 5.3% |
| 5 | 8485 | 5.2% |
| Other values (7) | 58670 |
ClaimAmount
Real number (ℝ)
| Distinct | 4490 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5014.2039 |
| Minimum | 100.12 |
|---|---|
| Maximum | 9997.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.3 KiB |
Quantile statistics
| Minimum | 100.12 |
|---|---|
| 5-th percentile | 590.725 |
| Q1 | 2509.0725 |
| median | 5053.765 |
| Q3 | 7462.4525 |
| 95-th percentile | 9510.307 |
| Maximum | 9997.2 |
| Range | 9897.08 |
| Interquartile range (IQR) | 4953.38 |
Descriptive statistics
| Standard deviation | 2866.2911 |
|---|---|
| Coefficient of variation (CV) | 0.57163433 |
| Kurtosis | -1.2029995 |
| Mean | 5014.2039 |
| Median Absolute Deviation (MAD) | 2483.075 |
| Skewness | 0.00042447153 |
| Sum | 22563917 |
| Variance | 8215624.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7946.69 | 2 | < 0.1% |
| 9963.71 | 2 | < 0.1% |
| 8936.33 | 2 | < 0.1% |
| 6118.26 | 2 | < 0.1% |
| 6834.26 | 2 | < 0.1% |
| 6116.75 | 2 | < 0.1% |
| 8414.04 | 2 | < 0.1% |
| 4153.18 | 2 | < 0.1% |
| 862.1 | 2 | < 0.1% |
| 5540.34 | 2 | < 0.1% |
| Other values (4480) | 4480 |
| Value | Count | Frequency (%) |
| 100.12 | 1 | |
| 100.3 | 1 | |
| 101.33 | 1 | |
| 106.47 | 1 | |
| 111.01 | 1 | |
| 113.4 | 1 | |
| 114.59 | 1 | |
| 115.49 | 1 | |
| 119.72 | 1 | |
| 131.86 | 1 |
| Value | Count | Frequency (%) |
| 9997.2 | 1 | |
| 9995.62 | 1 | |
| 9994.2 | 1 | |
| 9989.04 | 1 | |
| 9983.64 | 1 | |
| 9979.55 | 1 | |
| 9978.43 | 1 | |
| 9977.72 | 1 | |
| 9976.52 | 1 | |
| 9972.66 | 1 |
ClaimDate
Date
| Distinct | 731 |
|---|---|
| Distinct (%) | 16.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.3 KiB |
| Minimum | 2022-07-09 00:00:00 |
|---|---|
| Maximum | 2024-07-08 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
DiagnosisCode
Text
| Distinct | 4495 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.4 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 4490 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | Ta150 |
|---|---|
| 2nd row | Fo766 |
| 3rd row | AX876 |
| 4th row | SQ441 |
| 5th row | FK970 |
| Value | Count | Frequency (%) |
| me712 | 2 | < 0.1% |
| zf797 | 2 | < 0.1% |
| tt251 | 2 | < 0.1% |
| rs522 | 2 | < 0.1% |
| yl726 | 2 | < 0.1% |
| ia775 | 2 | < 0.1% |
| ej032 | 2 | < 0.1% |
| xa248 | 2 | < 0.1% |
| vy109 | 2 | < 0.1% |
| ae034 | 2 | < 0.1% |
| Other values (4476) | 4480 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 1388 | 6.2% |
| 2 | 1387 | 6.2% |
| 7 | 1383 | 6.1% |
| 5 | 1382 | 6.1% |
| 9 | 1364 | 6.1% |
| 6 | 1354 | 6.0% |
| 3 | 1339 | 6.0% |
| 1 | 1316 | 5.8% |
| 8 | 1300 | 5.8% |
| 0 | 1287 | 5.7% |
| Other values (52) | 9000 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 22500 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4 | 1388 | 6.2% |
| 2 | 1387 | 6.2% |
| 7 | 1383 | 6.1% |
| 5 | 1382 | 6.1% |
| 9 | 1364 | 6.1% |
| 6 | 1354 | 6.0% |
| 3 | 1339 | 6.0% |
| 1 | 1316 | 5.8% |
| 8 | 1300 | 5.8% |
| 0 | 1287 | 5.7% |
| Other values (52) | 9000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 22500 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4 | 1388 | 6.2% |
| 2 | 1387 | 6.2% |
| 7 | 1383 | 6.1% |
| 5 | 1382 | 6.1% |
| 9 | 1364 | 6.1% |
| 6 | 1354 | 6.0% |
| 3 | 1339 | 6.0% |
| 1 | 1316 | 5.8% |
| 8 | 1300 | 5.8% |
| 0 | 1287 | 5.7% |
| Other values (52) | 9000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 22500 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4 | 1388 | 6.2% |
| 2 | 1387 | 6.2% |
| 7 | 1383 | 6.1% |
| 5 | 1382 | 6.1% |
| 9 | 1364 | 6.1% |
| 6 | 1354 | 6.0% |
| 3 | 1339 | 6.0% |
| 1 | 1316 | 5.8% |
| 8 | 1300 | 5.8% |
| 0 | 1287 | 5.7% |
| Other values (52) | 9000 |
ProcedureCode
Text
| Distinct | 4495 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.4 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 4491 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | iO013 |
|---|---|
| 2nd row | jR349 |
| 3rd row | uU479 |
| 4th row | Xs264 |
| 5th row | PV476 |
| Value | Count | Frequency (%) |
| zw098 | 3 | 0.1% |
| zf251 | 2 | < 0.1% |
| ty099 | 2 | < 0.1% |
| ze112 | 2 | < 0.1% |
| yg753 | 2 | < 0.1% |
| jw378 | 2 | < 0.1% |
| ln407 | 2 | < 0.1% |
| mt610 | 2 | < 0.1% |
| nr774 | 2 | < 0.1% |
| wq534 | 2 | < 0.1% |
| Other values (4475) | 4479 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 1419 | 6.3% |
| 9 | 1395 | 6.2% |
| 1 | 1385 | 6.2% |
| 3 | 1369 | 6.1% |
| 7 | 1368 | 6.1% |
| 0 | 1350 | 6.0% |
| 6 | 1344 | 6.0% |
| 4 | 1309 | 5.8% |
| 8 | 1296 | 5.8% |
| 2 | 1265 | 5.6% |
| Other values (52) | 9000 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 22500 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5 | 1419 | 6.3% |
| 9 | 1395 | 6.2% |
| 1 | 1385 | 6.2% |
| 3 | 1369 | 6.1% |
| 7 | 1368 | 6.1% |
| 0 | 1350 | 6.0% |
| 6 | 1344 | 6.0% |
| 4 | 1309 | 5.8% |
| 8 | 1296 | 5.8% |
| 2 | 1265 | 5.6% |
| Other values (52) | 9000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 22500 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5 | 1419 | 6.3% |
| 9 | 1395 | 6.2% |
| 1 | 1385 | 6.2% |
| 3 | 1369 | 6.1% |
| 7 | 1368 | 6.1% |
| 0 | 1350 | 6.0% |
| 6 | 1344 | 6.0% |
| 4 | 1309 | 5.8% |
| 8 | 1296 | 5.8% |
| 2 | 1265 | 5.6% |
| Other values (52) | 9000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 22500 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5 | 1419 | 6.3% |
| 9 | 1395 | 6.2% |
| 1 | 1385 | 6.2% |
| 3 | 1369 | 6.1% |
| 7 | 1368 | 6.1% |
| 0 | 1350 | 6.0% |
| 6 | 1344 | 6.0% |
| 4 | 1309 | 5.8% |
| 8 | 1296 | 5.8% |
| 2 | 1265 | 5.6% |
| Other values (52) | 9000 |
PatientAge
Real number (ℝ)
| Distinct | 100 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.838444 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 45 |
| Zeros (%) | 1.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 25 |
| median | 50.5 |
| Q3 | 75 |
| 95-th percentile | 95 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 50 |
Descriptive statistics
| Standard deviation | 28.790471 |
|---|---|
| Coefficient of variation (CV) | 0.57767595 |
| Kurtosis | -1.2092364 |
| Mean | 49.838444 |
| Median Absolute Deviation (MAD) | 25.5 |
| Skewness | -0.02178574 |
| Sum | 224273 |
| Variance | 828.89121 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 57 | 64 | 1.4% |
| 25 | 59 | 1.3% |
| 70 | 58 | 1.3% |
| 1 | 57 | 1.3% |
| 16 | 56 | 1.2% |
| 81 | 56 | 1.2% |
| 48 | 55 | 1.2% |
| 79 | 55 | 1.2% |
| 76 | 54 | 1.2% |
| 97 | 54 | 1.2% |
| Other values (90) | 3932 |
| Value | Count | Frequency (%) |
| 0 | 45 | |
| 1 | 57 | |
| 2 | 44 | |
| 3 | 39 | |
| 4 | 33 | |
| 5 | 38 | |
| 6 | 34 | |
| 7 | 33 | |
| 8 | 49 | |
| 9 | 51 |
| Value | Count | Frequency (%) |
| 99 | 46 | |
| 98 | 44 | |
| 97 | 54 | |
| 96 | 45 | |
| 95 | 40 | |
| 94 | 35 | |
| 93 | 40 | |
| 92 | 49 | |
| 91 | 45 | |
| 90 | 35 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | F |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| F | 2282 | |
| M | 2218 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 2282 | |
| m | 2218 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 2282 | |
| M | 2218 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4500 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| F | 2282 | |
| M | 2218 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4500 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| F | 2282 | |
| M | 2218 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4500 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| F | 2282 | |
| M | 2218 |
ProviderSpecialty
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.6 KiB |
| Pediatrics | |
|---|---|
| Cardiology | |
| Orthopedics | |
| General Practice | |
| Neurology |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 11.179556 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Orthopedics |
|---|---|
| 2nd row | Cardiology |
| 3rd row | Cardiology |
| 4th row | Cardiology |
| 5th row | Neurology |
Common Values
| Value | Count | Frequency (%) |
| Pediatrics | 955 | |
| Cardiology | 907 | |
| Orthopedics | 893 | |
| General Practice | 880 | |
| Neurology | 865 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| pediatrics | 955 | |
| cardiology | 907 | |
| orthopedics | 893 | |
| general | 880 | |
| practice | 880 | |
| neurology | 865 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 5380 | |
| e | 5353 | |
| i | 4590 | 9.1% |
| o | 4437 | 8.8% |
| a | 3622 | 7.2% |
| c | 3608 | 7.2% |
| d | 2755 | 5.5% |
| t | 2728 | 5.4% |
| l | 2652 | 5.3% |
| s | 1848 | 3.7% |
| Other values (12) | 13335 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 50308 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 5380 | |
| e | 5353 | |
| i | 4590 | 9.1% |
| o | 4437 | 8.8% |
| a | 3622 | 7.2% |
| c | 3608 | 7.2% |
| d | 2755 | 5.5% |
| t | 2728 | 5.4% |
| l | 2652 | 5.3% |
| s | 1848 | 3.7% |
| Other values (12) | 13335 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 50308 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 5380 | |
| e | 5353 | |
| i | 4590 | 9.1% |
| o | 4437 | 8.8% |
| a | 3622 | 7.2% |
| c | 3608 | 7.2% |
| d | 2755 | 5.5% |
| t | 2728 | 5.4% |
| l | 2652 | 5.3% |
| s | 1848 | 3.7% |
| Other values (12) | 13335 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 50308 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 5380 | |
| e | 5353 | |
| i | 4590 | 9.1% |
| o | 4437 | 8.8% |
| a | 3622 | 7.2% |
| c | 3608 | 7.2% |
| d | 2755 | 5.5% |
| t | 2728 | 5.4% |
| l | 2652 | 5.3% |
| s | 1848 | 3.7% |
| Other values (12) | 13335 |
ClaimStatus
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 246.2 KiB |
| Approved | |
|---|---|
| Denied | |
| Pending |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.0022222 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Pending |
|---|---|
| 2nd row | Denied |
| 3rd row | Pending |
| 4th row | Pending |
| 5th row | Approved |
Common Values
| Value | Count | Frequency (%) |
| Approved | 1522 | |
| Denied | 1512 | |
| Pending | 1466 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| approved | 1522 | |
| denied | 1512 | |
| pending | 1466 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6012 | |
| d | 4500 | |
| n | 4444 | |
| p | 3044 | |
| i | 2978 | |
| o | 1522 | 4.8% |
| A | 1522 | 4.8% |
| r | 1522 | 4.8% |
| v | 1522 | 4.8% |
| D | 1512 | 4.8% |
| Other values (2) | 2932 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 31510 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 6012 | |
| d | 4500 | |
| n | 4444 | |
| p | 3044 | |
| i | 2978 | |
| o | 1522 | 4.8% |
| A | 1522 | 4.8% |
| r | 1522 | 4.8% |
| v | 1522 | 4.8% |
| D | 1512 | 4.8% |
| Other values (2) | 2932 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 31510 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 6012 | |
| d | 4500 | |
| n | 4444 | |
| p | 3044 | |
| i | 2978 | |
| o | 1522 | 4.8% |
| A | 1522 | 4.8% |
| r | 1522 | 4.8% |
| v | 1522 | 4.8% |
| D | 1512 | 4.8% |
| Other values (2) | 2932 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 31510 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 6012 | |
| d | 4500 | |
| n | 4444 | |
| p | 3044 | |
| i | 2978 | |
| o | 1522 | 4.8% |
| A | 1522 | 4.8% |
| r | 1522 | 4.8% |
| v | 1522 | 4.8% |
| D | 1512 | 4.8% |
| Other values (2) | 2932 |
PatientIncome
Real number (ℝ)
High correlation Unique
| Distinct | 4500 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84384.284 |
| Minimum | 20006.87 |
|---|---|
| Maximum | 149957.52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.3 KiB |
Quantile statistics
| Minimum | 20006.87 |
|---|---|
| 5-th percentile | 26070.636 |
| Q1 | 52791.905 |
| median | 84061.205 |
| Q3 | 115768.42 |
| 95-th percentile | 142561.27 |
| Maximum | 149957.52 |
| Range | 129950.65 |
| Interquartile range (IQR) | 62976.513 |
Descriptive statistics
| Standard deviation | 37085.909 |
|---|---|
| Coefficient of variation (CV) | 0.43948834 |
| Kurtosis | -1.1707225 |
| Mean | 84384.284 |
| Median Absolute Deviation (MAD) | 31383.245 |
| Skewness | 0.015295257 |
| Sum | 3.7972928 × 108 |
| Variance | 1.3753646 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 131676.02 | 1 | < 0.1% |
| 57595.11 | 1 | < 0.1% |
| 140772.72 | 1 | < 0.1% |
| 69803.19 | 1 | < 0.1% |
| 138895.98 | 1 | < 0.1% |
| 96529.57 | 1 | < 0.1% |
| 28830.41 | 1 | < 0.1% |
| 111654.49 | 1 | < 0.1% |
| 20440.42 | 1 | < 0.1% |
| 131764.74 | 1 | < 0.1% |
| Other values (4490) | 4490 |
| Value | Count | Frequency (%) |
| 20006.87 | 1 | |
| 20031.31 | 1 | |
| 20031.58 | 1 | |
| 20053.34 | 1 | |
| 20093.19 | 1 | |
| 20102.64 | 1 | |
| 20117.76 | 1 | |
| 20122.6 | 1 | |
| 20166.98 | 1 | |
| 20278.32 | 1 |
| Value | Count | Frequency (%) |
| 149957.52 | 1 | |
| 149935.67 | 1 | |
| 149913.57 | 1 | |
| 149857.61 | 1 | |
| 149837.5 | 1 | |
| 149820.25 | 1 | |
| 149819.49 | 1 | |
| 149812.83 | 1 | |
| 149794.76 | 1 | |
| 149728.97 | 1 |
PatientMaritalStatus
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 246.2 KiB |
| Married | |
|---|---|
| Widowed | |
| Divorced | |
| Single |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.0022222 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Single |
|---|---|
| 2nd row | Widowed |
| 3rd row | Married |
| 4th row | Married |
| 5th row | Divorced |
Common Values
| Value | Count | Frequency (%) |
| Married | 1181 | |
| Widowed | 1127 | |
| Divorced | 1101 | |
| Single | 1091 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| married | 1181 | |
| widowed | 1127 | |
| divorced | 1101 | |
| single | 1091 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 4536 | |
| i | 4500 | |
| e | 4500 | |
| r | 3463 | |
| o | 2228 | 7.1% |
| a | 1181 | 3.7% |
| M | 1181 | 3.7% |
| W | 1127 | 3.6% |
| w | 1127 | 3.6% |
| D | 1101 | 3.5% |
| Other values (6) | 6566 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 31510 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 4536 | |
| i | 4500 | |
| e | 4500 | |
| r | 3463 | |
| o | 2228 | 7.1% |
| a | 1181 | 3.7% |
| M | 1181 | 3.7% |
| W | 1127 | 3.6% |
| w | 1127 | 3.6% |
| D | 1101 | 3.5% |
| Other values (6) | 6566 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 31510 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 4536 | |
| i | 4500 | |
| e | 4500 | |
| r | 3463 | |
| o | 2228 | 7.1% |
| a | 1181 | 3.7% |
| M | 1181 | 3.7% |
| W | 1127 | 3.6% |
| w | 1127 | 3.6% |
| D | 1101 | 3.5% |
| Other values (6) | 6566 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 31510 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 4536 | |
| i | 4500 | |
| e | 4500 | |
| r | 3463 | |
| o | 2228 | 7.1% |
| a | 1181 | 3.7% |
| M | 1181 | 3.7% |
| W | 1127 | 3.6% |
| w | 1127 | 3.6% |
| D | 1101 | 3.5% |
| Other values (6) | 6566 |
PatientEmploymentStatus
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 250.7 KiB |
| Employed | |
|---|---|
| Unemployed | |
| Student | |
| Retired |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 8.0246667 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Employed |
|---|---|
| 2nd row | Employed |
| 3rd row | Student |
| 4th row | Employed |
| 5th row | Unemployed |
Common Values
| Value | Count | Frequency (%) |
| Employed | 1188 | |
| Unemployed | 1141 | |
| Student | 1110 | |
| Retired | 1061 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| employed | 1188 | |
| unemployed | 1141 | |
| student | 1110 | |
| retired | 1061 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6702 | |
| d | 4500 | |
| t | 3281 | |
| m | 2329 | 6.4% |
| y | 2329 | 6.4% |
| l | 2329 | 6.4% |
| o | 2329 | 6.4% |
| p | 2329 | 6.4% |
| n | 2251 | 6.2% |
| E | 1188 | 3.3% |
| Other values (6) | 6544 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 36111 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 6702 | |
| d | 4500 | |
| t | 3281 | |
| m | 2329 | 6.4% |
| y | 2329 | 6.4% |
| l | 2329 | 6.4% |
| o | 2329 | 6.4% |
| p | 2329 | 6.4% |
| n | 2251 | 6.2% |
| E | 1188 | 3.3% |
| Other values (6) | 6544 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 36111 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 6702 | |
| d | 4500 | |
| t | 3281 | |
| m | 2329 | 6.4% |
| y | 2329 | 6.4% |
| l | 2329 | 6.4% |
| o | 2329 | 6.4% |
| p | 2329 | 6.4% |
| n | 2251 | 6.2% |
| E | 1188 | 3.3% |
| Other values (6) | 6544 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 36111 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 6702 | |
| d | 4500 | |
| t | 3281 | |
| m | 2329 | 6.4% |
| y | 2329 | 6.4% |
| l | 2329 | 6.4% |
| o | 2329 | 6.4% |
| p | 2329 | 6.4% |
| n | 2251 | 6.2% |
| E | 1188 | 3.3% |
| Other values (6) | 6544 |
ProviderLocation
Text
| Distinct | 3876 |
|---|---|
| Distinct (%) | 86.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 268.4 KiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 12.056222 |
| Min length | 6 |
Unique
| Unique | 3417 ? |
|---|---|
| Unique (%) | 75.9% |
Sample
| 1st row | New Alishaview |
|---|---|
| 2nd row | East Curtis |
| 3rd row | Lake Jennifer |
| 4th row | Martinstad |
| 5th row | Thomasfurt |
| Value | Count | Frequency (%) |
| east | 336 | 5.0% |
| north | 333 | 4.9% |
| south | 330 | 4.9% |
| lake | 324 | 4.8% |
| west | 317 | 4.7% |
| port | 304 | 4.5% |
| new | 298 | 4.4% |
| michael | 32 | 0.5% |
| jennifer | 20 | 0.3% |
| james | 20 | 0.3% |
| Other values (3077) | 4428 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5232 | 9.6% |
| t | 4293 | 7.9% |
| a | 4276 | 7.9% |
| r | 4221 | 7.8% |
| o | 3682 | 6.8% |
| h | 3002 | 5.5% |
| n | 2964 | 5.5% |
| i | 2682 | 4.9% |
| s | 2615 | 4.8% |
| 2242 | 4.1% | |
| Other values (40) | 19044 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 54253 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 5232 | 9.6% |
| t | 4293 | 7.9% |
| a | 4276 | 7.9% |
| r | 4221 | 7.8% |
| o | 3682 | 6.8% |
| h | 3002 | 5.5% |
| n | 2964 | 5.5% |
| i | 2682 | 4.9% |
| s | 2615 | 4.8% |
| 2242 | 4.1% | |
| Other values (40) | 19044 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 54253 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 5232 | 9.6% |
| t | 4293 | 7.9% |
| a | 4276 | 7.9% |
| r | 4221 | 7.8% |
| o | 3682 | 6.8% |
| h | 3002 | 5.5% |
| n | 2964 | 5.5% |
| i | 2682 | 4.9% |
| s | 2615 | 4.8% |
| 2242 | 4.1% | |
| Other values (40) | 19044 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 54253 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 5232 | 9.6% |
| t | 4293 | 7.9% |
| a | 4276 | 7.9% |
| r | 4221 | 7.8% |
| o | 3682 | 6.8% |
| h | 3002 | 5.5% |
| n | 2964 | 5.5% |
| i | 2682 | 4.9% |
| s | 2615 | 4.8% |
| 2242 | 4.1% | |
| Other values (40) | 19044 |
ClaimType
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.9 KiB |
| Outpatient | |
|---|---|
| Routine | |
| Inpatient | |
| Emergency |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.7453333 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Inpatient |
|---|---|
| 2nd row | Inpatient |
| 3rd row | Emergency |
| 4th row | Routine |
| 5th row | Inpatient |
Common Values
| Value | Count | Frequency (%) |
| Outpatient | 1152 | |
| Routine | 1149 | |
| Inpatient | 1128 | |
| Emergency | 1071 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| outpatient | 1152 | |
| routine | 1149 | |
| inpatient | 1128 | |
| emergency | 1071 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 6861 | |
| n | 5628 | |
| e | 5571 | |
| i | 3429 | |
| u | 2301 | 5.8% |
| a | 2280 | 5.8% |
| p | 2280 | 5.8% |
| O | 1152 | 2.9% |
| R | 1149 | 2.9% |
| o | 1149 | 2.9% |
| Other values (7) | 7554 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 39354 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 6861 | |
| n | 5628 | |
| e | 5571 | |
| i | 3429 | |
| u | 2301 | 5.8% |
| a | 2280 | 5.8% |
| p | 2280 | 5.8% |
| O | 1152 | 2.9% |
| R | 1149 | 2.9% |
| o | 1149 | 2.9% |
| Other values (7) | 7554 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 39354 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 6861 | |
| n | 5628 | |
| e | 5571 | |
| i | 3429 | |
| u | 2301 | 5.8% |
| a | 2280 | 5.8% |
| p | 2280 | 5.8% |
| O | 1152 | 2.9% |
| R | 1149 | 2.9% |
| o | 1149 | 2.9% |
| Other values (7) | 7554 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 39354 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 6861 | |
| n | 5628 | |
| e | 5571 | |
| i | 3429 | |
| u | 2301 | 5.8% |
| a | 2280 | 5.8% |
| p | 2280 | 5.8% |
| O | 1152 | 2.9% |
| R | 1149 | 2.9% |
| o | 1149 | 2.9% |
| Other values (7) | 7554 |
ClaimSubmissionMethod
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 238.9 KiB |
| Paper | |
|---|---|
| Phone | |
| Online |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.3246667 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Paper |
|---|---|
| 2nd row | Online |
| 3rd row | Online |
| 4th row | Phone |
| 5th row | Phone |
Common Values
| Value | Count | Frequency (%) |
| Paper | 1544 | |
| Phone | 1495 | |
| Online | 1461 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| paper | 1544 | |
| phone | 1495 | |
| online | 1461 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4500 | |
| n | 4417 | |
| P | 3039 | |
| p | 1544 | 6.4% |
| a | 1544 | 6.4% |
| r | 1544 | 6.4% |
| h | 1495 | 6.2% |
| o | 1495 | 6.2% |
| O | 1461 | 6.1% |
| l | 1461 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 23961 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 4500 | |
| n | 4417 | |
| P | 3039 | |
| p | 1544 | 6.4% |
| a | 1544 | 6.4% |
| r | 1544 | 6.4% |
| h | 1495 | 6.2% |
| o | 1495 | 6.2% |
| O | 1461 | 6.1% |
| l | 1461 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 23961 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 4500 | |
| n | 4417 | |
| P | 3039 | |
| p | 1544 | 6.4% |
| a | 1544 | 6.4% |
| r | 1544 | 6.4% |
| h | 1495 | 6.2% |
| o | 1495 | 6.2% |
| O | 1461 | 6.1% |
| l | 1461 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 23961 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 4500 | |
| n | 4417 | |
| P | 3039 | |
| p | 1544 | 6.4% |
| a | 1544 | 6.4% |
| r | 1544 | 6.4% |
| h | 1495 | 6.2% |
| o | 1495 | 6.2% |
| O | 1461 | 6.1% |
| l | 1461 | 6.1% |
Cluster
Categorical
High correlation
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 219.9 KiB |
| 3 | |
|---|---|
| 0 | |
| 2 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 2 |
| 3rd row | 3 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 1152 | |
| 0 | 1144 | |
| 2 | 1104 | |
| 1 | 1100 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 1152 | |
| 0 | 1144 | |
| 2 | 1104 | |
| 1 | 1100 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 1152 | |
| 0 | 1144 | |
| 2 | 1104 | |
| 1 | 1100 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4500 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 1152 | |
| 0 | 1144 | |
| 2 | 1104 | |
| 1 | 1100 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4500 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 1152 | |
| 0 | 1144 | |
| 2 | 1104 | |
| 1 | 1100 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4500 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 1152 | |
| 0 | 1144 | |
| 2 | 1104 | |
| 1 | 1100 |
ClaimLegitimacy
Categorical
Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 258.1 KiB |
| Legitimate | |
|---|---|
| Fraud | 270 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.7 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Legitimate |
|---|---|
| 2nd row | Legitimate |
| 3rd row | Legitimate |
| 4th row | Legitimate |
| 5th row | Legitimate |
Common Values
| Value | Count | Frequency (%) |
| Legitimate | 4230 | |
| Fraud | 270 | 6.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| legitimate | 4230 | |
| fraud | 270 | 6.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8460 | |
| t | 8460 | |
| i | 8460 | |
| a | 4500 | |
| L | 4230 | |
| g | 4230 | |
| m | 4230 | |
| F | 270 | 0.6% |
| r | 270 | 0.6% |
| u | 270 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 43650 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 8460 | |
| t | 8460 | |
| i | 8460 | |
| a | 4500 | |
| L | 4230 | |
| g | 4230 | |
| m | 4230 | |
| F | 270 | 0.6% |
| r | 270 | 0.6% |
| u | 270 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 43650 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 8460 | |
| t | 8460 | |
| i | 8460 | |
| a | 4500 | |
| L | 4230 | |
| g | 4230 | |
| m | 4230 | |
| F | 270 | 0.6% |
| r | 270 | 0.6% |
| u | 270 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 43650 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 8460 | |
| t | 8460 | |
| i | 8460 | |
| a | 4500 | |
| L | 4230 | |
| g | 4230 | |
| m | 4230 | |
| F | 270 | 0.6% |
| r | 270 | 0.6% |
| u | 270 | 0.6% |
Interactions
Correlations
| ClaimAmount | ClaimLegitimacy | ClaimStatus | ClaimSubmissionMethod | ClaimType | Cluster | PatientAge | PatientEmploymentStatus | PatientGender | PatientIncome | PatientMaritalStatus | ProviderSpecialty | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ClaimAmount | 1.000 | 0.406 | 0.000 | 0.006 | 0.044 | 0.022 | 0.009 | 0.007 | 0.022 | 0.019 | 0.014 | 0.017 |
| ClaimLegitimacy | 0.406 | 1.000 | 0.000 | 0.000 | 0.000 | 0.437 | 0.030 | 0.013 | 0.009 | 0.400 | 0.000 | 0.000 |
| ClaimStatus | 0.000 | 0.000 | 1.000 | 0.014 | 0.000 | 0.005 | 0.016 | 0.013 | 0.000 | 0.000 | 0.018 | 0.024 |
| ClaimSubmissionMethod | 0.006 | 0.000 | 0.014 | 1.000 | 0.009 | 0.000 | 0.011 | 0.000 | 0.000 | 0.026 | 0.023 | 0.018 |
| ClaimType | 0.044 | 0.000 | 0.000 | 0.009 | 1.000 | 0.023 | 0.000 | 0.000 | 0.000 | 0.026 | 0.000 | 0.000 |
| Cluster | 0.022 | 0.437 | 0.005 | 0.000 | 0.023 | 1.000 | 0.029 | 0.000 | 0.004 | 0.920 | 0.000 | 0.000 |
| PatientAge | 0.009 | 0.030 | 0.016 | 0.011 | 0.000 | 0.029 | 1.000 | 0.000 | 0.000 | 0.017 | 0.000 | 0.014 |
| PatientEmploymentStatus | 0.007 | 0.013 | 0.013 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| PatientGender | 0.022 | 0.009 | 0.000 | 0.000 | 0.000 | 0.004 | 0.000 | 0.000 | 1.000 | 0.028 | 0.017 | 0.018 |
| PatientIncome | 0.019 | 0.400 | 0.000 | 0.026 | 0.026 | 0.920 | 0.017 | 0.000 | 0.028 | 1.000 | 0.008 | 0.000 |
| PatientMaritalStatus | 0.014 | 0.000 | 0.018 | 0.023 | 0.000 | 0.000 | 0.000 | 0.000 | 0.017 | 0.008 | 1.000 | 0.000 |
| ProviderSpecialty | 0.017 | 0.000 | 0.024 | 0.018 | 0.000 | 0.000 | 0.014 | 0.000 | 0.018 | 0.000 | 0.000 | 1.000 |
Missing values
Sample
| ClaimID | PatientID | ProviderID | ClaimAmount | ClaimDate | DiagnosisCode | ProcedureCode | PatientAge | PatientGender | ProviderSpecialty | ClaimStatus | PatientIncome | PatientMaritalStatus | PatientEmploymentStatus | ProviderLocation | ClaimType | ClaimSubmissionMethod | Cluster | ClaimLegitimacy | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 4d76c7f7-d36a-4139-b451-a9a4ad10d7d5 | 19cf2638-3ec0-4ed9-9995-d9ba4553813a | a3d0cc80-dffe-40ff-a302-23c8ffeedb36 | 7820.52 | 2024-07-08 | Ta150 | iO013 | 96 | F | Orthopedics | Pending | 57595.11 | Single | Employed | New Alishaview | Inpatient | Paper | 3 | Legitimate |
| 1 | e35193b4-3609-492b-866a-98de19317e9c | 5c4bb6c5-4dd3-4a86-85fa-f36c0d8debff | a9f25acf-92b8-45e2-9cef-87bd07d0a591 | 5453.86 | 2024-07-08 | Fo766 | jR349 | 95 | M | Cardiology | Denied | 140772.72 | Widowed | Employed | East Curtis | Inpatient | Online | 2 | Legitimate |
| 2 | 1f3fa373-25ed-4ff4-b6c7-38dcb2fb297f | 777866e0-4d10-45a8-a7b4-dbdaa26d5a81 | 951b1e08-9948-4956-80e5-9277f16bd290 | 8229.86 | 2024-07-08 | AX876 | uU479 | 10 | M | Cardiology | Pending | 69803.19 | Married | Student | Lake Jennifer | Emergency | Online | 3 | Legitimate |
| 3 | af6a68f4-8319-47b1-a28b-77de01572851 | 9d7c53ee-eb1a-4f07-9e3a-e86cf82e9f0f | de9e193a-f9a1-4d63-9345-aefe75694628 | 9519.16 | 2024-07-08 | SQ441 | Xs264 | 59 | F | Cardiology | Pending | 135530.12 | Married | Employed | Martinstad | Routine | Phone | 2 | Legitimate |
| 4 | 417fe944-79d2-4610-81c4-a2d496f29ee4 | db14b0ca-ac2a-4e83-b085-947ea32e7587 | 5c7d7045-71b6-4c15-937c-43e4cfe65bf4 | 3226.15 | 2024-07-08 | FK970 | PV476 | 36 | F | Neurology | Approved | 36995.52 | Divorced | Unemployed | Thomasfurt | Inpatient | Phone | 1 | Legitimate |
| 5 | 41c69c3f-7b63-435c-841f-97633264a347 | 9caba0e6-334d-4132-9330-1c1adaa82d11 | 11ff25ad-29c9-493b-a356-cb0c6a8f41a6 | 3476.56 | 2024-07-07 | ZE958 | Am159 | 26 | F | Cardiology | Denied | 96819.09 | Divorced | Retired | North Michael | Outpatient | Paper | 0 | Legitimate |
| 6 | 80a92d69-9d51-476c-8d1d-0ea35a7081a9 | c4daf0c4-8d67-4aba-97db-442a948db4d3 | 19d62078-bb03-4473-8815-5f814c12b5c8 | 6468.55 | 2024-07-07 | hg131 | vm240 | 3 | M | Neurology | Denied | 117271.04 | Married | Employed | West Paul | Emergency | Paper | 2 | Legitimate |
| 7 | 31c804c9-110c-4c26-bf38-2638b9e29526 | 71d7c4ac-c608-4392-8f71-83ce85d00595 | c1bfab96-0df6-4a49-96e1-236bd3c6a7b5 | 280.40 | 2024-07-07 | Xa559 | eD733 | 99 | M | Pediatrics | Denied | 125318.21 | Widowed | Employed | Ambermouth | Inpatient | Paper | 2 | Legitimate |
| 8 | 25d801f8-d141-4131-9f1f-0c63360b4302 | 919f254e-a7eb-41da-8f14-b2ee11aad6da | 8d1a5376-5ea6-42e6-beec-2c1313a30a49 | 4661.71 | 2024-07-07 | Sj663 | uq058 | 57 | F | Neurology | Pending | 24263.98 | Widowed | Employed | Larsonville | Inpatient | Paper | 1 | Legitimate |
| 9 | 8b5172a0-9aab-439a-9e3f-d0af5f2a1b6b | 6d707925-803e-42b8-af6e-e77d9f45dd8a | ed55392c-f9e3-469c-9367-00df827b1cf6 | 9638.64 | 2024-07-07 | Qu671 | Gw549 | 91 | M | Pediatrics | Approved | 78191.10 | Widowed | Unemployed | South Jessicabury | Outpatient | Phone | 3 | Legitimate |
| ClaimID | PatientID | ProviderID | ClaimAmount | ClaimDate | DiagnosisCode | ProcedureCode | PatientAge | PatientGender | ProviderSpecialty | ClaimStatus | PatientIncome | PatientMaritalStatus | PatientEmploymentStatus | ProviderLocation | ClaimType | ClaimSubmissionMethod | Cluster | ClaimLegitimacy | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4490 | a996469c-9d91-437d-9a13-33384ec86e27 | 6c35f381-e4fc-4d5b-a4e4-fc8abeed154a | 04a1cc5f-73cc-4bca-b6f0-af3fc2529ec4 | 5879.35 | 2022-07-10 | FC909 | jB020 | 84 | F | Cardiology | Denied | 129931.42 | Divorced | Unemployed | Jacobsberg | Outpatient | Paper | 2 | Legitimate |
| 4491 | 6c85c4f1-bc4b-46fb-a573-3fea7853cb38 | 05ec1ef6-bef2-43af-befa-c00eb68af7a9 | 67cd32ac-9518-417a-aa8e-bc46911951e4 | 6250.80 | 2022-07-10 | aW066 | su896 | 91 | M | General Practice | Pending | 43191.77 | Married | Employed | Lake Edwardmouth | Routine | Phone | 1 | Legitimate |
| 4492 | 4e2838d1-2819-441f-a435-97c22b9f4e8b | d708e793-c837-40e4-8d4c-852a94b4e87f | 3fc3e1bd-a685-4a9e-8014-1d81a6eaffe5 | 8290.29 | 2022-07-10 | tl190 | mO870 | 22 | M | Orthopedics | Denied | 86328.07 | Widowed | Retired | Garcialand | Inpatient | Phone | 0 | Legitimate |
| 4493 | 8a2ffe23-6145-49b3-89a4-1013c4e858e2 | 516ea776-3d69-4c5f-a9d2-bae1698b28d2 | 2fcfedff-0300-4c6d-82ba-cb4858c7487f | 9102.27 | 2022-07-09 | ZQ868 | GC202 | 0 | M | Orthopedics | Approved | 48185.86 | Married | Student | South Anthonyfurt | Emergency | Phone | 1 | Fraud |
| 4494 | 4c4e4abc-e65d-485e-9882-c44485e63917 | f3697794-b8d7-4c0d-a18c-72e5cab95d95 | 98f91962-bcf3-482b-8ea7-f003d74c86ae | 1189.51 | 2022-07-09 | Ux531 | bJ956 | 68 | F | Pediatrics | Denied | 108225.81 | Married | Employed | Herreraborough | Routine | Paper | 0 | Legitimate |
| 4495 | 6c427360-20ae-43b8-802f-bd25fae3ce09 | c0ddd919-1b16-4689-9963-7566ba410835 | c0039b67-ace3-4f97-a646-4214419f9fdf | 3041.50 | 2022-07-09 | qJ110 | bn806 | 10 | M | General Practice | Denied | 80395.76 | Widowed | Student | New Melissastad | Emergency | Paper | 3 | Legitimate |
| 4496 | 43b72c25-94ae-4f1f-a2fb-cb3297978674 | 02ea4377-cf98-4251-a1d6-8eb720d903d8 | 2dcbfa56-e73a-42b5-bfbf-02bfb9b3f990 | 5153.28 | 2022-07-09 | dc670 | wX329 | 96 | F | Neurology | Pending | 31560.84 | Widowed | Retired | Lake Cathymouth | Outpatient | Phone | 1 | Legitimate |
| 4497 | e0bf8e55-7440-48bb-9583-187ab12a5682 | 14844cfb-2bff-4be5-8540-7d58c72ed309 | ae3fdf78-c574-495a-ba8a-2246ba1d61a5 | 6908.45 | 2022-07-09 | cF152 | aT402 | 97 | F | Pediatrics | Denied | 74973.94 | Married | Unemployed | Garyborough | Inpatient | Online | 3 | Legitimate |
| 4498 | 1a3f947a-f3a7-4286-8925-aed2eced6ee2 | cfedbf0b-43eb-4dbe-a26b-74bd566898c8 | d344683d-f2e2-4262-8c04-f9e92fda1d33 | 5830.19 | 2022-07-09 | Sc398 | wv342 | 14 | F | General Practice | Approved | 147665.80 | Widowed | Student | East Claudiafurt | Routine | Paper | 2 | Legitimate |
| 4499 | 291cfa64-9956-40e7-b89f-4628650f42f0 | 2bd2d173-4ce1-428d-836c-259d9236a839 | cf84cf99-0ac3-465a-af90-239a873bafa5 | 5848.92 | 2022-07-09 | TQ972 | Sn273 | 47 | M | Pediatrics | Approved | 131676.02 | Married | Unemployed | North Amberborough | Inpatient | Phone | 2 | Legitimate |